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Abstract. Recent results and interpretations are presented for the thermal minority 
game, concentrating on deriving and justifying the fundamental stochastic differential 
equation for the microdynamics. 



Market economics poses several problems of potential interest and challenge to 
statistical physics, involving the co-operative behaviour of many agents whose ac- 
tions involve mutual frustration and disorder, both quenched and stochastic. In 
a nutshell, speculators in an idealized stock market are made up of buyers and 
sellers, each having personal gain as their objectives, trying to buy low and sell 
high, making their decisions based on commonly available information using indi- 
vidual strategies, with their collective actions determining the (time- varying) 'right 
choices' and learning from experience. From the point of view of the market regu- 
lator, however, preference is for low volatility and market efficiency 

The minority game (MG) is a simple encapsulation of some of the ingredients 
and issues of a market. It consists of iV agents each of whom at each step of a 
parallel dynamical process makes either of two choices, with the objective of being 
in the minority overall. The agents have no direct knowledge of one another and 
make their decisions based on purely global information /(£), available equally to all. 
Their decisions are determined through the application to I(t) of individual strategy 
functions, each agent having a small number of such strategies, drawn randomly 
and independently from a large distribution at the outset and fixed throughout the 
game. At each time-step each agent employs (just) one of his or her strategies. 
Adaptation occurs through the development of functions which determine their 
choices of strategy. 

In the original formulation [1] the information I(t) was the minority choice over 
the last m time-steps and the adaptation was achieved through the cumulative 
award of points at each time-step to the strategies which would have yielded the 
actual minority choice at that step. The strategy played by any agent at any time- 
step was that of his/her strategies which currently had the largest point-score. 

A remarkable observation in simulations [2] was that the variance in the minority 



choice became smaller than that of random choice for large enough m, indicating 
correlation of the agents' actions. A critical memory length m c was observed for 
minimum variance, with agents appearing to be frozen in their choices for m > m c , 
non-frozen for m < m c . Moreover, it was shown that the dependence on m was 
through the scaling variable D/N, where D = 2 m was the dimension of the space 
of strategies [2]. Further simulations showed (i) these results are unaffected by re- 
placing the true history by a random I(t) [3], indicating that as far as macroscopic 
observables are concerned the 'information' merely effectuates the correlation; (ii) 
replacing the deterministic strategy-choices by stochastic ones can significantly re- 
duce the volatility for information vectors of less than the critical length [4]. 

Here we consider the determination of a fundamental analytic theory and report 
the derivation of the underlying stochastic differential equation for the microdynam- 
ics [5]. We concentrate on a continuous formulation in which I(t) is a stochastically 
randomly chosen unit-length vector on a D-dimensional hypersphere, the strategies 
are quenched random vectors of length \f~D in the same space, Rf,i = 1, . . . ,N 
labeling the agents and the a = l,...,s their strategies. The analogues of the 
binary choices above are bids b?(t) = Rf ■ I(t). The strategies which are actually 
used are indicated by R*(t). The total bid at time t is A(t) = 52iR*(t) ■ I(t). The 
point update rule is 

p?(t+i) = mt)-%(t)Mt)/N. (i) 

For simplicity we specialize to s = 2 and define 

g = (i5-i?)/2, u t = (Rl + R*)/2; Pi (t) = Pl(t) - Pf{t) (2) 

In a generalized thermal minority game (TMG) the probability of strategy use is 

^(O^fl + exp^/^^)))]- 1 (3) 

and it is useful to define a 'spin' 

8i (t) = nl(t) - 7rf(t) = tanh(/3/(p^))). (4) 

In [4] the choice f(p) = p was employed, but here we consider f(p) = sgn(p) = z 
[5]. ' 

We are interested in coarse-grained average behaviour on a time-scale greater 
than the step-length in order to pass to a continuum-time theory. Equivalently, we 
take a time-scale At with I(t) a differential random noise I(t) = AW(t) with zero 
mean and variance At. In the limit At — > oo a Kramers-Moyal expansion yields [5] 

d Pi (t) = -(ND)' 1 ^^) ■ iidt + 0(dt 2 ) (5) 



so that to 0(dt) the information noise has been eliminated in favour of an effective 
interaction between the agents and the averaged variance becomes 



a 2 ee N-\A(ty) = (ND)- 1 J2(R*(t) ■ R*(t)), 

ij 



(6) 



where the (•) refer to a temporal average and the bar to an average over the 
quenched disorder of the strategies. 

At T = Eqs. (2) are deterministic and to leading order in dt reduce to 

dp/dt = -VsH\ s =z; P = (pi, -Pn) (7) 



H = Y^hiSi + JijSiSj, (8) 

i i^j 



h^iNDy^Uj-gi, J ij = (ND)- 1 ( i -( j (9) 
j 

At finite temperature correlations between the fluctuations of the right hand 
sides of Eqs. (5) are of the same order as the mean and Eq. (7) must be replaced 
by a set of stochastic differential equations [5] 

dp = -V s Udt + M ■ dW (10) 

where M. = {Mij} is the covariance matrix 

MiMt)] = Y J ikMi-4(t)} (ii) 

K 

and W(t) is an A^-dimensional Wiener process of unit scale; for T = Sk(t) 2 = 1 
so the Wiener term has no weight. Correspondingly, the Fokker-Planck equation 
for the probability distribution of the Pi is 

§-?!-(f p H5i«^ (12) 

The average volatility is given by 



a 2 ee AT 1 ^) 2 ) = I + 2(H) (13) 

Eq. (10) is thus the fundamental microscopic equation from which the macrody- 
namics should be calculable. To check this we have compared numerical evaluations 
of the volatility and the density of frozen agents (those for whom Pi(t) does not 
change sign after initial transient) from Eqs. (7) and (10) with corresponding di- 
rect simulations from Eqs. (1) and (3). They are in perfect accord. This is shown 
explicitly in Fig. 1 for T = 0. Figs. 2a and 2b show the effect of temperature as 
given by Eq. (10); direct simulation gave results identical within statistical error. 




FIGURE 1. Volatility a as a function of the reduced dimension d = D/N. Squares correspond 
to the original dynamics of Eq. (1), circles to Eq. (7); T — and Pi(Q) ~ 0. Diamonds correspond 
to minimization of 7i. Inset: fraction of frozen agents. 



In [6] (Ti) was evaluated for T = on the assumption that the system equilibrated 
and hence was equivalent to minimizing Ti. The result is also exhibited in Fig. 1 
and can be seen to be good (and probably correct) for d = D/N > d c but in error 
for d < d c . In fact, however, Eq. (7) does not describe a simple descent dynamics 
since the variables on the right and left hand sides are different and a metric is 
needed to relate p and s. Substitution shows that the dynamics in non-Markovian 
in s. An explicit demonstration of non-equilibration for d < d c follows from a 
simulation starting with |pi(0)| ~ 0(1) ^> dt, where dt is the time-step. This is 
illustrated in Fig. 3a [5]. 

It is also tempting to compare with a Hopfield neural network which is charac- 
terizable by an effective Hamiltonian 

-i p=aN 

H = -nT, J*W, J% = E # • $ (14) 

where the {£f } are quenched random patterns; indeed it was by the application 
of techniques devised for (14) that [6] minimized H. Clearly there is a difference 
of sign between Eqs. (8) and (14) but one might be tempted to anticipate that 
this will merely suppress the retrieval attractors while maintaining the spin glass 
state, in analogy with the SK model, and then attribute the reduction in energy 
compared with the random state to spin-glass binding. However, this is false; the 
Hopfield spin-glass solution is not symmetric under change of sign of (3. Rather, 
the random-field term of Eq. (8) is crucial in reducing the ground state energy of H 
and the volatility below their random-state values. This is demonstrated explicitly 
in Fig. 3b which shows the effect of choosing R\ randomly but Rj as 



FIGURE 2. (a) Volatility as a function of the temperature from the continuous dynamics Eq. 
(10);/(p) = sgn(p), pi(0) - 0. Inset: volatility as a function of d for T = 1(T 3 , 1, 2, and 10. (b) 
Fraction of frozen agents as a function of T. Inset: schematic cross-over phase diagram. 



R\ = -(1 - X)Rl + XRi 



(15) 



where Ri is also chosen randomly [10,11]. For A = 0, Ui = 0, hi = and the volatility 
never falls below random, although again there appears a (different) critical d c 
separating regimes (worse-than-random and random). 

As noted, above we have used f(p) = sgn(p) in the simulations. If instead 
fip) = P is employed, then for d > d c the system iterates over a long time to 
its zero-temperature behaviour [7] since the mean \pi(t)\ grows quasi-continuously 
and Si(t) saturates to its zero-temperature value, which being ±1 eliminates the 
effects of the second term of Eq. (10). However, for d < d c there continues to be an 
improvement with temperature, to an optimal value which is better than random 
and is reached at a temperature of 0(1) [8], but without any further rise to the 
random value; in this case Pi(t) fluctuates around p,(0). 

Finally we remark on the relationship with the crowd-anticrowd concept of [9], 
where a crowd is a group of agents playing the same strategy and the corresponding 
anticrowd play the opposite strategy. From Eq. (6) 



where e M is a unit vector in the // Cartesian direction of the D-dimensional space. 
rifj, then formalizes the notion of the number of agents in crowd \x minus the number 
in the corresponding anti-crowd. The qualitative difference of d < d c and d > d c 
then follows from the recognition that for d d c the vectors Rf are densely 
distributed on the D-sphere permitting n u ~ O(N), while for d ^> d c they are 
sparsely distributed so all n u ~ 0(1). 



(16) 





FIGURE 3. (a) Volatility as a function of d for random initial conditions |pi(0)| ~ 0(1) S> dt 
for original dynamics and from Eq. (10). (b) Volatility for partially anticorrelated Rl' 2 ; 
Rf = -(1 - X)Rj + XRi; Rj,Ri random. 
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